Minimizing Syntactic Dependency Lengths: Typological/Cognitive Universal?
نویسندگان
چکیده
Syntactic dependencies are head/modifier relations between words in a sentence that organize sentences into a syntactic tree structure. The general principle that languages have a preference to group syntactically related words close together can be made precise as a preference for shorter dependencies. We examine evidence for this principle in the development of languages’ grammars as well as in the choices made by individual speakers where syntactic variation is licensed. We survey evidence from corpus studies, computational simulations, and experiments on comprehension; altogether, this evidence makes a compelling case for DLM as an important factor in language structure and cognition.
منابع مشابه
Linguistic Typology meets Universal Dependencies
Current work on universal dependency schemes in NLP does not make reference to the extensive typological research on language universals, but could benefit since many principles are shared between the two enterprises. We propose a revision of the syntactic dependencies in the Universal Dependencies scheme (Nivre et al. [16, 17]) based on four principles derived from contemporary typological the...
متن کاملAn annotation scheme for Persian based on Autonomous Phrases Theory and Universal Dependencies
A treebank is a corpus with linguistic annotations above the level of the parts of speech. During the first half of the present decade, three treebanks have been developed for Persian either originally or subsequently based on dependency grammar: Persian Treebank (PerTreeBank), Persian Syntactic Dependency Treebank, and Uppsala Persian Dependency Treebank (UPDT). The syntactic analysis of a sen...
متن کاملAdventures in Multilingual Parsing
The typological diversity of the world’s languages poses important challenges for the techniques used in machine translation, syntactic parsing and other areas of natural language processing. Statistical models developed and tuned for English do not necessarily perform well for richly inflected languages, where larger morphological paradigms and more flexible word order gives rise to data spars...
متن کاملBriefly Noted
Seeking to develop the first satisfying universal definition of basic grammatical units of the major syntactic categories such as "noun," "verb," and "adjective" and of the principal grammatical relations, "subject," "object," and "oblique," William Croft employs the methods of cross-linguistic or typological analysis and cognitive linguistics. Drawing evidence from a wide variety of languages,...
متن کاملUniversal Reordering via Linguistic Typology
In this paper we explore the novel idea of building a single universal reordering model from English to a large number of target languages. To build this model we exploit typological features of word order for a large number of target languages together with source (English) syntactic features and we train this model on a single combined parallel corpus representing all (22) involved language p...
متن کامل